Probing an optimal class distribution for enhancing prediction and feature characterization of plant virus-encoded RNA-silencing suppressors.

نویسندگان

  • Abhigyan Nath
  • Karthikeyan Subbiah
چکیده

To counter the host RNA silencing defense mechanism, many plant viruses encode RNA silencing suppressor proteins. These groups of proteins share very low sequence and structural similarities among them, which consequently hamper their annotation using sequence similarity-based search methods. Alternatively the machine learning-based methods can become a suitable choice, but the optimal performance through machine learning-based methods is being affected by various factors such as class imbalance, incomplete learning, selection of inappropriate features, etc. In this paper, we have proposed a novel approach to deal with the class imbalance problem by finding the optimal class distribution for enhancing the prediction accuracy for the RNA silencing suppressors. The optimal class distribution was obtained using different resampling techniques with varying degrees of class distribution starting from natural distribution to ideal distribution, i.e., equal distribution. The experimental results support the fact that optimal class distribution plays an important role to achieve near perfect learning. The best prediction results are obtained with Sequential Minimal Optimization (SMO) learning algorithm. We could achieve a sensitivity of 98.5 %, specificity of 92.6 % with an overall accuracy of 95.3 % on a tenfold cross validation and is further validated using leave one out cross validation test. It was also observed that the machine learning models trained on oversampled training sets using synthetic minority oversampling technique (SMOTE) have relatively performed better than on both randomly undersampled and imbalanced training data sets. Further, we have characterized the important discriminatory sequence features of RNA-silencing suppressors which distinguish these groups of proteins from other protein families.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Supervised Learning Classification Models for Prediction of Plant Virus Encoded RNA Silencing Suppressors

Viral encoded RNA silencing suppressor proteins interfere with the host RNA silencing machinery, facilitating viral infection by evading host immunity. In plant hosts, the viral proteins have several basic science implications and biotechnology applications. However in silico identification of these proteins is limited by their high sequence diversity. In this study we developed supervised lear...

متن کامل

Suppressors of RNA silencing encoded by plant viruses and their role in viral infections.

RNA silencing as a robust host defense mechanism against plant viruses is generally countered by virus-encoded silencing suppressors. This strategy is now increasingly recognized to be used by animal viruses as well. We present here an overview of the common features shared by some of the better studied plant viral silencing suppressors. We then briefly describe the characteristics of the few r...

متن کامل

Probing the microRNA and small interfering RNA pathways with virus-encoded suppressors of RNA silencing.

In plants, small interfering RNAs (siRNAs) and microRNAs (miRNAs) are effectors of RNA silencing, a process involved in defense through RNA interference (RNAi) and in development. Plant viruses are natural targets of RNA silencing, and as a counterdefensive strategy, they have evolved highly diverse silencing suppressor proteins. Although viral suppressors are usually thought to act at distinct...

متن کامل

Three distinct suppressors of RNA silencing encoded by a 20-kb viral RNA genome.

Viral infection in both plant and invertebrate hosts requires a virus-encoded function to block the RNA silencing antiviral defense. Here, we report the identification and characterization of three distinct suppressors of RNA silencing encoded by the approximately 20-kb plus-strand RNA genome of citrus tristeza virus (CTV). When introduced by genetic crosses into plants carrying a silencing tra...

متن کامل

Probing the MicroRNA and Small Interfering RNA Pathways with Virus-Encoded Suppressors of RNA Silencing W

In plants, small interfering RNAs (siRNAs) and microRNAs (miRNAs) are effectors of RNA silencing, a process involved in defense through RNA interference (RNAi) and in development. Plant viruses are natural targets of RNA silencing, and as a counterdefensive strategy, they have evolved highly diverse silencing suppressor proteins. Although viral suppressors are usually thought to act at distinct...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • 3 Biotech

دوره 6 1  شماره 

صفحات  -

تاریخ انتشار 2016